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Abstract 

A word w G Ffc, the free group on k generators, is called primitive if it 
belongs to some basis of Ffc. Associated with w and a finite group G is the 
word map w : G x . . . x G — > G defined on the direct product of k copies of 
G. We call w measure preserving if given uniform measure on G x . . . x G, 
the image of this word map induces uniform measure on G (for every finite 
group G). It is easy to see that every primitive word is measure preserving, 
and several authors have conjectured that the two properties are, in fact, 
equivalent. Here we prove this conjecture. The main ingredients of the 
proof include random coverings of Stallings graphs, algebraic extensions 
of free groups and Mobius inversions. Our methods yield the stronger 
result that a subgroup of is measure preserving iff it is a free factor. 

As an interesting corollary of this result we resolve a question on the 
profinite topology of free groups and show that the primitive elements of 
Ffc form a closed set in this topology. 

1 Introduction 

Let F fc be the free group on k generators X = {xi t . . . , Xk}, and let w 6 Ffc 
be expressed in the letters of the basis X, as io = nj=i x i 3 ' where Ej = ±1. 
Associated with w is a word map: for every group G, this map, which we also 
denote by w, sends G X G X • • • X G to G by (<7i, . . . , gk) 11^=1 9? ■ Alterna- 

k 

tively, we think of the word map w as the evaluation map from Horn (Ffc, G) to 
G, i.e., w (a) — a (w) for a € Horn (Ffc, G). The identification of Horn (Ffc, G) 
with G k is due to the fact that a homomorphism from a free group is uniquely 
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determined by choosing the images of the elements of a basis, and these images 
can be chosen arbitrarily. 

The last years have seen a great interest in word maps in groups, and research 
around them has been very active (see, for instance, |Sha09j . |LS09| ; for a recent 
book on the topic see |Seg09| ). Our focus here is on the property of measure 
preservation: for every finite group G, we consider G x . . . x G as being equipped 
with the uniform probability distribution. Then, every w S induces some 
measure on G via the associated word map. We say that the word w preserves 
measure with respect to G if this induced measure is again uniform. (In other 
words, if all fibers of the word map have the same size). We say that w is 
measure preserving if it preserves measure with respect to every finite group G. 

Measure preservation can be equivalently defined in the language of homo- 
morphisms: w is measure preserving iff for every finite group G, the element 
a<3 (w) is uniformly distributed over G, where ag G Horn (F& , G) is a homo- 
morphism chosen uniformly at random. 

This concept was investigated in several recent works. See for example [LS08J 
and |GS09) , where certain word maps are shown to be almost measure preserv- 
ing, in the sense that the distribution induced by w on finite simple groups G 
tends to uniform as |G| — > oo. 

One finds in the literature an even stronger notion of measure preservation 
on a word w, where the image of w is considered over compact groups G w.r.t. 
their Haar measure. Our results make use only of the weaker condition that 
involves solely finite groups. 

The notion of measure preservation has a strong connection to primitivity. 
An element to of a free group J is called primitive if it belongs to some basis 
(free generating set) of J. When J is given with a basis X, this is equivalent 
to the existence of an automorphism of J which sends w to a given element of 
X. It is an easy observation that primitive words are measure preserving. The 
reason is that as mentioned, a homomorphism ag £ Horn (F&, G) is determined 
by the images of the elements of a basis of F& , which can be chosen arbitrarily 
and independently. 

Several authors have conjectured that the converse is also true. Namely, that 
measure preservation implies primitivitjQ. From private conversations we know 
that this has occurred to the following mathematicians and discussed among 
themselves: T. Gelander, M. Larsen, A. Lubotzky and A. Shalev. The question 
was mentioned several times in the Einstein Institute Algebra Seminar, and was 
independently raised in |AV11| and also in |LP10| . 

In [ Pudllj we proved the conjecture for F2. Here we prove it in full. A 
key ingredient of the proof is the extension of the problem from single words to 
(finitely generated) subgroups of F^. The concept of primitive words extends 
naturally to the notion of free factors. Let H be a subgroup of the free group 
J (in particular, H is free as well). We say that H is a free factor of J, and 

J It is interesting to note that there is an easy abelian parallel to this conjecture. A word 
w £ Ffe belongs to a basis of Z, k = F^/FJ. iff for any group G the associated word map is 
surjective. See [Seg09| , Lemma 3.1.1. 
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denote this by H < J, if there is a subgroup W < J such that H * H 1 = J. 

* 

Equivalently, H < J iff some basis of H can be extended to a basis of J. (This 
in turn is easily seen to be equivalent to the condition that every basis of H 
extends to a basis of J.) 

The concept of measure preservation can also be extended from words in 
the free group to finitely generated subgroups (we write H < f g when H is a 
finitely generated subgroups of F^): 

Definition 1.1. Let H <f g Fj.. We say that H is measure preserving if for 
every finite group G and ag G Hom(F^,G) a random homomorphism chosen 
with uniform distribution, ctG| ff is uniformly distributed in Horn (if, G). 

In particular, 1 ^ w G Ffc is measure preserving iff (w) is measure preserving. 
As for single words, it is easy to see that if H <f g Fk is a free factor, then H 
is measure preserving. In |Pudll| we conjectured that the converse also holds, 
and proved this conjecture for subgroups of F^ of rank > k — 1 (thus proving 
the conjecture for F 2 ). Here we prove the extended conjecture in full: 

Theorem 1.2. For every w G F^, 

w is measure preserving w is primitive 

More generally, for H <f g Fk, 

H is measure preserving -<==^ H < Fk 

In Section [7] we explain how this circle of ideas is related to the study of 
profinite groups. In particular we have the following corollary (see also Corollary 
\7H : 

Corollary 1.3. The set P of primitive elements in Fk is closed in the profinite 
topology. 

In order to prove Theorem 1 1.2[ one needs to exhibit, for each non-primitive 
word w G Ffc, some "witness" finite group with respect to which w is not measure 
preserving. Our witnesses are always the symmetric groups S n . In fact, it is 
enough to restrict one's attention to the average number of fixed points in the 
random permutation as n (w) (which we also denote by a n {w)). We summarize 
this in the following stronger version of Theorem 11.21 

Theorem (|1.2l 'l. Letw G F^,, and for every finite group G , letotQ G Hom(Fk,G) 
denote a random homomorphism chosen with uniform distribution. Then the 
following are equivalent: 

(1) w is primitive. 

(2) w is measure preserving: for every finite group G the random element 
a G ( w ) has uniform distribution. 
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(3) For every n G N the random permutation a n (w) — as n {w) has uniform 
distribution. 

(4) For every neN, the expected number of fixed points in the random per- 
mutation a n (w) — as n (w) is 1: 

E[#fix(a„H)] = 1 

(5) For infinitely many n£N, 

E[#fix (a n (w))} < 1 

We already explained above the implication (1) =$> (2), and (2) => (3) => 
(4) =>• (5) is evident (recall that a uniformly distributed random permutation has 
exactly one fixed point on average). The only nontrivial, somewhat surprising, 
part is the implication (5) => (1) which is proven in this paper. 

With the proper adjustments, Theorem 1 1.21 applies to f.g. subgroups as well. 
For example, the parallel of property (4) for H <f g Ffc is that for every n, the 
image a n (H) C S n stabilizes on average exactly n 1-rk ( ff ) elements of {1, . . . , n} 
(where rk (H ) denotes the rank of H ) . As stated, it turns out that this property 
alone is enough to yield that H is a free factor of Ffc . 

A key role in the proof is played by the notion of primitivity rank, an invariant 
classifying words and f.g. subgroups of Ffc, which was first introduced in [Pudll] . 
A primitive word w £ F& is also primitive in every subgroup containing it (Claim 
12 . 8tf3|) ) . However, if w is not primitive in F^, it may be either primitive or non- 
primitive in subgroups of F^ containing it. But what is the smallest rank of a 
subgroup giving evidence to the imprimitivity of w7 Informally, how far does 
one have to search in order to establish that w is not primitive? Concretely: 



Definition 1.4. The primitivity rank of w 6 F^, denoted tt (w), is 
ir(w) = min < rk (J) 



w£j<F k s.t. 
w is not primitive in J I 



If no such J exists, i.e. if w is primitive, then n (w) — oo. A subgroup J for 
which the minimum is obtained is called w-critical. 

More generally, for H <f g F^ , the primitivity rank of H is 



7r (H) = min < rk (J) 



H < J < F fc s.t. 
H is not a free factor of J 



Again, if no such J exists, then tt (H) — oo, and a subgroup J for which the 
minimum is obtained is called 77-critical. 

Note that for w ^ 1, ir (w) = tt ({vj}). For instance, tt (w) — 1 if and only if 
w is a proper power (i.e. w — v d for some v G Ffc and d > 2). By Claim [2~8t [3)l . 
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Table 1: Primitivity Rank and Average Number of Fixed Points. 



in Ffc the primitivity rank takes values only in {0, 1, 2, . . . , k} U {oo} (the only 

* 

word w with tt (w) — is w = 1). Moreover, tt (H) = oo iff H < Ffc. Finally, 
Lemma 23 from |Pudll| yields that tt can take on every value in {o, . . . , k}E it 
is interesting to mention that tt (H) also generalizes the notion of compressed 
subgroups, as appears, e.g., in [MVW07J: a subgroup H <t g Ffc is compressed 
iff 7r (H) > rk (H). Finally, in Section[3]we show how tt (•) can be computed. 

In this paper we sometimes find it more convenient to deal with reduced 
ranks of subgroups: rk (H) d = rk(_ff) — 1. We therefore define analogously the 
reduced primitivity rank, tt (•) d = tt (•) — 1. 

As mentioned above, our main result follows from an analysis of the aver- 
age number of common fixed points of a n (H) (where a n denotes a uniformly 
distributed random homomorphism in Horn (Ffc, S n )). In other words, we count 
the number of elements in {1, . . . ,n} stabilized by the images under a n of all 
elements of H. Theorem 11.21 follows from the main result of this analysis: 

Theorem 1.5. For every H <f g Ffc, denote by Crit (H) the set of H- critical 
subgroups o/Ffc. The average number of common fixed points of a n (H) is 

1 , | Crit (H) | , Q ( 1 



n rk{H) n^W \n' K W+ 1 

In particular, the set of i7-critical subgroups Crit (H) is finite. Theorem 
11.51 is stated for subgroups, but it equally applies to single words: the av- 
erage number of fixed points of the random permutation a n (w) is the same 
as the average number of common fixed points of a n {(w)). This average is 
1 + | Crit (w) | • n-*^ + O (n - ^ -1 ), where Crit (w) denotes the set of to- 
critical subgroups. Table Q] summarizes the connection implied by Theorem 11.51 
between the primitivity rank of w and the average number of fixed points in the 
random permutation a n (to) . 

Theorem 11.51 implies the following general corollary regarding the family of 
distributions of S n induced by word maps: 



2 For example, it follows from this lemma that in Ffc , for every 1 < d < k, tt (iEj 2 . . . x j 
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Corollary 1.6. For an imprimitive w G F fc the average number of fixed points 
in a n (w) is strictly greater than 1, for large enough n. 

Corollary 11.61 is in fact the missing piece (5) (1) in Theorem 11.21 . In 
addition, it follows from this corollary that for every w G Fj. and large enough 
n, the average number of fixed points in a n (w) is at least on^f|. In other words, 
primitive words generically induce a distribution of S n with the fewest fixed 
points on average. 

The results stated above validate completely the conjectural picture de- 
scribed in [Pudllj. Theorem 11.51 and its consequences, Corollaries 17.11 [TTSI and 
11.61 are stated there as conjectures (Conjectures 7, 24 and 27). 

The analysis of the average number of fixed points in a n (w) has its roots in 
|Nic94j . Nica notices that by studying the various quotients of a labeled cycle- 
graph (corresponding to u;), one can compute a rational expression in n which 
gives this average for every large enough n. When w — u d with d maximal 
(so u is not a power), he shows that the limit distribution of the number of 
fixed points in a n (w) (as n —> oo) depends solely on His results yield that 
the average number of fixed points in a n (w) is 5 (d) + O (i) where S (d) is the 
number of (positive) divisors of d ( |Nic 94j . Corollary 1.3). This is equivalent to 
Theorem 11.51 for words which are powers, namely, for w G with ir (w) = 1 
(see Table [lj. Indeed, if w — u d as above with d > 1, then 

Crit(io) = { (u m ) \l<m<d,m\d}, 

so |CritH \ = 6(d)-l. 

The first steps in extending these results to non-power words were made 
in [LP10], where the case n (w) = 2 was partially handled. The statement of 
Theorem 11.51 for this case was completely proven in [Pudllj . In [Pudllj the 
analysis was also extended to f.g. subgroups, and the statement of Theorem 1 1.5 1 
was more generally proven for every H <f g Fk with rk (H) > k — 1. However, 
the technique used in these two papers is specialized for the proven cases, and 
so far could not be generalized to yield the general result. 

In the current paper, better analysis of the average number of fixed points in 
a n (w) is achieved by studying a broader question: for every pair of H , J < f g F^ 
such that H < J, we define for n G N 

<5>h,j (n) = The expected number of common fixed points of aj iU (H), (1-1) 

where aj, n G Horn (J, S n ) is a random homomorphism chosen with uniform dis- 
tribution. The original function studied by Nica is the special case &( w ) t F k (n). 

In addition, the current paper uses much more extensively various combina- 
torial structures of posets (partially order sets) on the set of f.g. subgroups of 

3 It is suggestive to ask whether this holds for all n. Namely, is it true that for every 
w £ Ffc and every n, the average number of fixed points in a n (w) is at least 1? By results of 
Abert ( IAbe06 |), this statement turns out to be false. 

4 Nica's result is in fact more general: the same statement holds not only for fixed points 
but for cycles of length L for every fixed L. 
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Ffc. This set has, of course, a natural structure of a lattic^l given by the relation 
of inclusion. However, there are other interesting partial orders defined on this 
set, some of which are much sparser and have the advantage of being locally 
finit^E If ^ is some partial order on the subgroups of F/., and H, J < F&, we 
define the closed interval 

[H, J] x = {L < F k | H ^ L d J} 

and similarly the open interval (H, J)^ = {L < F^ | H ^ L ^ J}, the half- 
bounded interval [H, oo)^, = {L <Fk\H d: L}, and so on (see also the glos- 
sary). 

The sparsest partial order we consider here is that of Algebraic Extensions. 
This notion was introduced in [KM02] and further studied in [MVW 07J. 

Definition 1.7. Let H, J be free groups. We say that J is an algebraic extension 
of H , denoted H < a ig J, i£ H < J and H is not contained in any proper free 
factor of J. 

The terminology comes from similarities (that go only to some extent) be- 
tween this notion and that of algebraic extensions of fields (in this line of 

thought, J is a transcendental extension of H when H < J). 

The relation of algebraic extension turns out to be a partial order on the 
set of subgroups of F^: it is clearly reflexive and antisymmetric, but it is also 
transitive (Claim [XT1) . In addition, it is very sparse: it turns out that [H, oo) alg , 
the set of algebraic extensions of H, is finite for every H <f g F&, so in particular 
this partial order is locally finite on the set of f.g. subgroups of F^. (These 
claims appear in [MV W07| . We repeat some of the arguments in Section [3]). 
It is a simple observation that ff-critical subgroups are in particular algebraic 
extensions of H, i.e. Crit (H) C [H, oo) al . In fact, they are the proper algebraic 
extensions of minimal rank (see Section |3]) . 

Between the rather dense partial order of inclusion and the very sparse one of 
algebraic extensions, lies an infinite family of intermediate partial orders, one for 
every choice of basis for Ffe. Although the dependency on the basis makes these 
orders somewhat less universal, they turn out to be extremely useful. These 
partial orders are based on the notion of quotients, or surjective morphisms, of 
core graphs. Introduced in [Sta83 , core graphs provide a geometric approach to 
the study of free groups (for an extensive survey see |KM 02l. and also [MVW07] 
and the references therein). Fix a basis X — {xi, . . . ,Xk} of Fj.. Associated 
with every H < Fj, is a directed, pointed graph denoted Fx (H), whose edges 
are labeled by the elements of the basis X. A full definition appears in Section 
[21 but we illustrate the concept in Figure 11.11 It shows the core graph of the 
subgroup of F2 generated by xix^xi and x^ 2 X2- 

The aforementioned relation holds for the pair H, J iff the associated core 
graph Fx (J) is a quotient (as a pointed labeled graph) of the core graph Fx (H) 

5 By a lattice we refer here to a partially ordered set in which every two elements have 
unique infimum and supremum. 

6 A locally finite poset is one in which every closed interval [a, b] = {x : a < x < b} is finite. 
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Figure 1.1: The core graph Tx (H) 
where H = (xxx^ x\,x^ x%) < F2. 




(see Definition 123]) . We denote this by H <^ J. When H < fg F fe , T x (H) is 
finite (Claim liOlf Tjl). and thus has only finitely many quotients. Furthermore, 
it turns out that different groups correspond to different core graphs. Thus, this 
partial order, restricted to f.g. subgroups, is too locally finite. 

As claimed, these partial orders are indeed intermediate: if H < a ig J then 
H <-» J for every basis X, and if H J then H < J. (These claims appear 
in |MVW07| . and also here in Sections [2] and [3). In |Pudll| . the poset of 
f.g. subgroups of Ffe w.r.t. "<^" was studied, and a distance function denoted 
px (H,J) was introduced for H , J <f g F k such that H <^> J (Definition 12.61 
in the current paper). The following theorem ( |Pudll| . Theorem 1) is a key 
ingredient in the proof of Theorem 11.51 

Theorem 1.8. Let H,J <f g Ffc and assume further that H <-» J. Then 
H < J if and only if 

Px (H, J) = rk (J) — rk (H) 

The local-finiteness of the partial orders "< a ; g " and allows us to use 

machinery such as Mobius inversion on functions defined on the set of compara- 
ble pairs of f.g. subgroups. Indeed, we use this to "derive" the function $ (recall 
(| 1 . 1 [) ) and obtain its "right derivation" R x , its "left derivation" L x , and its "two 
sided derivation" C x (see Section [4]). For instance, &h,j can be presented as 
finite sums of R x : 

$HJ = ]T R^ M 

AfG[J?,J]_» 



M 



H <- M <^ J 



(here [H, J]-» is an abbreviation for [H, J] < , i.e. [H, J]-» = < 1 

The proof of Theorem 11.51 (yielding our main result) is based on the following 
results: 



(Proposition 14. ip The right derivation R x is supported on algebraic ex- 
tensions, i.e. if H M but M is not an algebraic extension of H then 

• (The discussion in Section[S]) The random homomorphism S Horn (J, S n ) 
can be encoded as a random covering-space of the core graph Tx (J), and 
&h,j i n ) can then be interpreted as the expected number of lifts of Tx (H). 
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• f Lemmas 15.31 and 15 .41) The left derivation L x is then the expected number 
of injective lifts of the core graph Tx (H) into the random covering of the 
core graph Tx (J), and a rational expression can be computed for j. 

• (Proposition 16.11 and Section I6.1[) An analysis involving Stirling numbers 
of the rational expressions for L x yields a combinatorial meaning for the 
two-sided derivation C x . Using Theorem 11.81 and the classification of 
primitivity rank we obtain a first-order estimate for the size of j. 

• (Proposition 16. 2p From C x we return to R x (by "left-integration") , ob- 
taining that whenever H < a ig M we have 



and by right integration of R , we obtain the order of magnitude of $ , 
which was our goal. 

The paper is arranged as follows: in Section [2] the notion of core graphs is 
explained in details, as well as the partial order <-», and the background for 
Theorem ll.81 In Section[3]we survey the main properties of algebraic extensions 
of free groups. Section U is devoted to recalling Mobius derivations on locally- 
finite posets and introducing the different derivations of $. In Section [5] we 
discuss the connection of the problem to random coverings of graphs and analyze 
the left derivation L x . The proof of Theorem 11.51 is completed in Section [6] via 
the analysis of the two-sided derivation C x and the consequence of the latter on 
the right derivation R x . Finally, corollaries of our results to the field of profinite 
groups, and to decidability questions in group theory, are discussed in Section 
[7J We finish with a list of open problems naturally arising from this paper. For 
the reader's convenience, there is also a glossary of notions and notations at the 
end of this manuscript. 



Fix a basis X — {xi, . . . , Xk] of F^,. Associated with every subgroup H < Fj, is 
a directed, pointed graph whose edges are labeled by X. This graph is called the 
core-graph associated with H and is denoted by Tx (H). We recall the notion 
of the Schreier (right) coset graph of H with respect to the basis X, denoted by 
Tx (H)- This is a directed, pointed and edge- labeled graph. Its vertex set is the 
set of all right cosets of H in Fj,, where the basepoint corresponds to the trivial 
coset H. For every coset Hw and every basis-element Xj there is a directed 
j-edge (short for iEj-edge) going from the vertex Hw to the vertex 77^x^0 

Alternatively, Tx (H) is the quotient H\T, where T is the Cayley graph of with 
respect to the basis X, and (and thus also H) acts on this graph from the left. Moreover, 
this is the covering-space of Fx (Fk) = Tx (^fc), the bouquet of k loops, corresponding to H, 
via the correspondence between pointed covering spaces of a space Y and subgroups of its 
fundamental group 7ri (Y). 




2 Core Graphs and the Partial Order "< 



X 
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The core graph Tx (H) is obtained from Tx (H) by omitting all the ver- 
tices and edges of Tx {H) which are not traced by any reduced (i.e., non- 
backtracking) path that starts and ends at the basepoint. Stated informally, we 
trim all "hanging trees" from Fx (H). Formally, Fx (H) is the induced subgraph 
of Tx (H) whose vertices are all cosets Hw (with w reduced) , such that for some 
word w' the concatenation ww' is reduced, and w-w' G H. To illustrate, Figure 
12.11 shows the graphs T x (H) and T x (H) for H = (xix 2 x^ 3 , XiX 2 x^ 2 ) < F 2 . 
Note that the graph Tx (H) is 2fc-regular: every vertex has exactly one outgoing 
j'-edge and one incoming j-edge, for every 1 < j < k. Every vertex of Tx (H) 
has at most one outgoing j-edge and at most one incoming j-edge, for every 
1 < j < k. 




Figure 2.1: T x (H) and T X {H) for H = { Xl x 2 x^ ,x?x 2 x^ 2 ) < F 2 . The 
Schreier coset graph Tx (H) is the infinite graph on the left (the dotted lines 
represent infinite 4-regular trees). The basepoint "(g)" corresponds to the trivial 
coset H, the vertex below it corresponds to the coset Hxi, the one further down 
corresponds to Hx£* — Rx\x 2 x^ x , etc. The core graph Tx (H) is the finite 
graph on the right, which is obtained from Tx (H) by omitting all vertices and 
edges that are not traced by reduced closed paths around the basepoint. 

If r is a directed pointed graph labeled by some set X, paths in T correspond 
to words in F (X) (the free group generated by X). For instance, the path (from 
left to right) 

X2 X2 Xi X2 X3 X2 X\ 
• 9- • S- • 9- • -< • 9- • 9- • -t • 

corresponds to the word x^xxx^x-sXix^ 1 ■ The set of all words obtained from 
closed paths around the basepoint in T is a subgroup of F (X) which we call the 
labeled fundamental group of T, and denote by tt^ (T). Note that tt* (T) need 
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not be isomorphic to m (T) , the standard fundamental group of T viewed as a 

topological space - for example, take T — x i (^J®^) Xl ■ 

However, it is not hard to show that when T is a core graph, then tt* (T) is 
isomorphic to m (T) (e.g. |MVW07] ). In this case the labeling gives a canonical 
identification of tti (r) as a subgroup of F (X). It is an easy observation that 

Trf (T x (H)) = Trf (T X (H)) = H (2.1) 

This gives a one-to-one correspondence between subgroups of F (X) = Ffc and 
core graphs labeled by X. Namely, ir* and Fx are the inverses of each other in 
a bijection (Galois correspondence) 



Subgroups 1 J Core graphs 1 

ofF(Jf) J t^x { labeled by X J 



(2.2) 



Core graphs were introduced by Stallings [Sta83 . Our definition is slightly dif- 
ferent, and closer to the one in (KM02, MVW07] in that we allow the basepoint 
to be of degree one, and in that our graphs are directed and edge-labeled. (This 
can be easily introduced into Stallings' framework, see |Pudll| ). We remark 
that it is possible to study core graphs from a purely combinatorial point of 
view, as labeled pointed connected graphs satisfying 

(1) No two equally labeled edges originate or terminate at the same vertex. 

(2) Every vertex and edge are traced by some non-backtracking closed path 
around the basepoint. 

Starting with this definition, every choice of an ordered basis for Ffc then gives 
a correspondence between these graphs and subgroups of Ffc . 

In this paper we are mainly interested in finite core graphs, and we now list 
some basic properties of these (proofs can be found in |Sta83 ( KM02. MVW07J). 

Claim 2.1. Let H be a subgroup of Ffc with an associated core graph T — 
Tx (H). The Euler Characteristic of a graph, denoted x(-), is the number of 
vertices minus the number of edges. 

(1) rk (H) < oo <==^ r is finite. 

(2) vk(H) = - X (T). 

(3) The correspondence i2.2\) restricts to a correspondence between finitely 
generated subgroups of Ffc and finite core graphs. 

Given a finite set of words {hi, ■ ■ ■ , h m } C F (X) that generate a subgroup 
H, the core graph Fx (H) can be algorithmically constructed as follows. Every 
hi corresponds to some path with directed edges labeled by the Xj 's (we assume 
the elements are given in reduced forms, otherwise we might need to prune leaves 
at the end of the algorithm) . Merge these m paths to a single graph (bouquet) 
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by identifying all their 2m end-points to a single vertex, which is marked as the 
basepoint. The labeled fundamental group of this graph is clearly H . Then, as 
long as there are two j-labeled edges with the same terminus (resp. origin) for 
some j, merge the two edges and their origins (resp. termini). Such a step is 
often referred to as Stallings folding. It is fairly easy to see that each folding step 
does not change the labeled fundamental group of the graph, that the resulting 
graph is indeed Tx (H), and that the order of folding has no significance. To 
illustrate, we draw in Figure |2~21 a folding process by which we obtain the core 
graph Tx (H) of H = (xiX2X^ , x^x^x^ ) < F 2 from the given generating set. 




2 



Figure 2.2: Constructing the core graph Tx (H) of H = {xiX2Xi 3 , x^X2X^ 2 ) < 
F2 from the given generating set. We start with the upper left graph which 
contains a distinct loop at the basepoint for each (reduced) element of the 
generating set. Then, at an arbitrary order, we merge pairs of equally-labeled 
edges which share the same origin or the same terminus (here we mark by triple 
arrows the pair of edges being merged next). The graph at the bottom right is 
Tx (H), as it has no equally-labeled edges sharing the same origin or terminus. 

A morphism between two core-graphs is a map that sends vertices to vertices 
and edges to edges, and preserves the structure of the core graphs. Namely, it 
preserves the incidence relations, sends the basepoint to the basepoint, and 
preserves the directions and labels of the edges. 

As in Claim 12.11 each of the following properties is either proven in (some 
of) |Sta83l IKM021 IMVW07] or an easy observation: 

Claim 2.2. Let H,J,L< Fk be subgroups. Then 

(1) A morphism Tx (H) — > Tx (J) exists if and only if H < J. 

(2) If a morphism Tx (H) — > Tx (J) exists, it is unique. We denote it by 

(3) Whenever H < L < J, T}§^j = r/£^ 7 o n£^ L B 

8 Points (l)-(3) can be formulated by saying that A2.2I) is in fact an isomorphism of cate- 
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(4) If Vh->j * s infective, then H < J[J 

(5) Every morphism in an immersion (locally injective at the vertices). 

A special role is played by surjective morphisms of core graphs: 

Definition 2.3. Let H < J < Ffc. Whenever n^^j is surjective, we say that 
Tx (H) covers Tx (J) or that Tx (J) is a quotient of Fx {H). We indicate this 
by Tx (H) -» Tx (J). As for the groups, we say that H X -covers J and denote 
this by H J. 

By "surjective" we mean surjective on both vertices and edges. Note that we 
use the term "covers" even though in general this is not a topological covering 
map (a morphism between core graphs is always locally injective at the vertices, 
but it need not be locally bijective) . In Section [5] we do study topological 
covering maps, and we reserve the term "coverings" for these. 

For instance, H = {x\X2X^ i ,x^X2X^' 1 ) < F& X-covers the group J = 
(x2, x 1 , X1X2X1), the corresponding core graphs of which are the leftmost and 
rightmost graphs in Figure 12.31 As another example, a core graph T X-covers 
Tx (Ffc) (which is merely a wedge of k loops) if and only if it contains edges of 
all k labels. 

As implied by the notation, the relation H J indeed depends on the 
given basis X of Ffe. For example, if H — (X1X2) then H F2. However, for 
Y = {xiX2,X2}, H does not Y-cover F2, as Ty (H) consists of a single vertex 
and a single loop and has no quotients apart from itself. 

It is easy to see that the relation "<^" indeed constitutes a partial ordering 
of the set of subgroups of Ffc. We make a few other useful observations: 

Claim 2.4. Let H,J,L< Ffc be subgroups. Then 

(1) Whenever H < J there exists an intermediate subgroup M such that H <^> 
M < J. 

(2) If one adds the condition that Tx (M) embeds in Tx (J), then this M is 
unique. 

(3) If H <^ J and H <^ L < J, then L <^ J. 

(4) If H is finitely generated then it X -covers only a finite number of groups. 
In particular, the partial order "<^> " restricted to f.g. subgroups of Ffc is 
locally finite. 

Proof. Point (1) follows from the factorization of the morphism n^^j to a 
surjection followed by an embedding. Indeed, it is easy to see that the image 
of Vh^j i s a sub-graph of Tx (J) which is in itself a core graph. Namely, it 
contains no "hanging trees" (edges and vertices not traced by reduced paths 

gories, given by the functors it* and Tx- 

9 But not vice-versa: for example, consider (xix^ ) < F2. 
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around the basepoint). Let M — ir* [rmrig^j) be the subgroup corresponding 
to this sub-core-graph. (1) now follows from points (1) and (4) in Claim [2~2l 
Point (2) follows from the uniqueness of such factorization of a morphism. Point 
(3) follows from the fact that if tj^^j = rj^.j o rj^^ L is surjective then so is 
T)l^j. Point (4) follows from the fact that T x (H) is finite (Claim 0[T])) and 
thus has only finitely many quotients, and each quotient correspond to a single 
group (by (123)). □ 

Following [MVW07J, we call the set of X-quotients of H, namely, 

[H,oo) 1 = {j\H< 1 j} (2.3) 

the X -fringe of H . Claim |2~4"1J 4|) states in this terminology that for every H <f g 
Ffc (and every basis X), [H,oo)^ < oo. Note that [H, oo)^> always contains 
the supremum of its elements, namely the group generated by the elements of 
X which label edges in Tx (H) (which is tti (im?yj|^ F )). 

It is easy to see that quotients of (H) are determined by the partition 
they induce of the vertex set V (Tx {H)). However, not every partition P of 
V (Tx (H)) corresponds to a quotient core- graph: in the resulting graph, which 
we denote by r A-(-ff)/p, two distinct j-edges may have the same origin or the 
same terminus. Then again, when a partition P of V (Tx (H)) yields a quotient 
which is not a core-graph, we can perform Stallings foldings (as demonstrated 
in Figure l2~2f until we obtain a core graph. Since Stallings foldings do not affect 
it* , the core graph we obtain in this manner is Tx (J), where J = tt* ( r x(ff)/p). 
The resulting partition P of V (Tx (H)) (as the fibers of ry^^j) is the finest 
partition of V (Tx (H)) which gives a quotient core- graph and which is still 
coarser than P. We illustrate this in Figure |2~51 

Thus, there is sense in examining the quotient of a core graph T "generated" 
by some partition P of its vertex set, namely, Tx ( r / p ))- The most in- 
teresting case is that of the "simplest" partitions: those which identify only a 
single pair of vertices. Before looking at these, we introduce a measure for the 
complexity of partitions: if P C 2 X is a partition of some set X, let 

\\P\\ d = , \X\-\P\=J2 (\B\-1). (2.4) 

BeP 

Namely, ||P|| is the number of elements in the set minus the number of blocks in 
the partition. For example, ||P|| = 1 iff P identifies only a single pair of elements. 
It is not hard to see that ||P|| is also the minimal number of identifications one 
needs to make in X in order to obtain the equivalence relation P. 

Definition 2.5. Let T be a core graph and let P be a partition of V (T) with 
||P|| = 1, i.e. having a single non-trivial block of size two. Let A be the core 
graph generated from T by P. We then say that A is an immediate quotient of 

r. 
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Figure 2.3: The left graph is the core graph Tx (H) of H = 
(xxXiX^ ,xlx%Xi ) < F 2 . Its vertices are denoted by vi,...,v^. The 
graph in the middle is the quotient r x(H)/ P corresponding to the partition 
P = {{vi,Vi} , {V2} , {v^}}. This is not a core graph as there are two 1- 
edges originating at {1)1,114}. In order to obtain a core quotient-graph, we 
use the Stallings folding process (illustrated in Figure l2~2l . The resulting core 
graph, Fx {it* ( rx ^V^)), is shown on the right and corresponds to the parti- 
tion P = {{vi,Vi} , {v 2 , wall- 
Alternatively, we say that A is generated by identifying a single pair of ver- 
tices of T. For instance, the rightmost core graph in Figure l2~3l is an immediate 
quotient of the leftmost one. 

The main reason that immediate quotients are interesting is their algebraic 
significance. Let iJ, J < F& with T = Fx (H) , A = Fx (J) their core graphs, 
and assume that A is an immediate quotient of F obtained by identifying the 
vertices u, v G V (F). Now let w u , w v £ Ffe be the words corresponding to some 
paths PmPv in T from the basepoint to u and v respectively (note that these 
paths are not unique). It is not hard to see that identifying u and v has the 
same effect as adding the word w = WuW^ 1 to H and considering the generated 
group. Namely, that J = (H,w). 




The relation of immediate quotients gives the set of finite core graphs (with 
edges labeled by 1, . . . , k) the structure of a directed acyclic graph (IMG0- 

10 that is, a directed graph with no directed cycles. 
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This DAG was first introduced in |Pudll| . and is denoted by T>k- The set 
of vertices of T>k consists of the aforementioned core graphs, and its directed 
edges connect every core graph to its immediate quotients. Every ordered basis 
X = {x\, . . . ,Xk} of Ffe determines a one-to-one correspondence between the 
vertices of this graph and all finitely generated subgroups of F&. 

It is easy to see that in the case of finite core graphs, A is a quotient of 
r if and only if A is reachable from F in Z\. (that is, there is a directed path 
from r to A). In other words, ii H <f g then H <-. J iff Fx (J) can be 
obtained from Fx (H) by a finite sequence of immediate quotients. Thus, for 
any H <f g Ffe, the subgraph of T>k induced by the descendants of Fx (if) 
consists of all quotients of Fx (if), i.e. of all (core graphs corresponding to) 
elements of the X-fringe [ff, oo)-.. By Claim |2"T4"1 |4")) . this subgraph is finite. 
In Figure 12.41 we draw the subgraph of T>k consisting of all quotients of Fx (if) 
when H — (x\X2X^ 1 x^ 1 ) ■ The edges of this subgraph (i.e. immediate quotients) 
are denoted by the dashed arrows in the figure. 

® 1 — > • 

2 2 



• > • 




Figure 2.4: The subgraph of T>k induced by \H,oo)-~, that is, all quotients 

of the core graph F = Fx (if), for if = {X1X2X1 1 x^ 1 ) ■ The dashed arrows 
denote immediate quotients, i.e. quotients generated by merging a single pair 
of vertices. F has exactly seven quotients: itself, four immediate quotients, and 
two quotients at distance 2. 

It is now natural to define a distance function between a finite core graph 
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and each of its quotients: 

Definition 2.6. Let H,J <f g be subgroups such that H <^> J, and let 

T = Tx {H), A = Fx (J) be the corresponding core graphs. We define the 
X-distance between H and J, denoted px (H, J) or p (L, A), to be the shortest 
length of a directed path from T to A in T>y.. 

In other words, px (H, J) is the length of the shortest series of immediate 
quotients that yields A from T. There is another useful equivalent definition for 
the X-distance. To see this, assume that T' is generated from T by the partition 
P of V (r) and let r\ : T -» V be the morphism. For every x,y £ V (V), let 
x' 6 rf 1 (x) , y' S r]" 1 (y) be arbitrary vertices in the fibers, and let P' be the 
partition of V (T) obtained from P by identifying x' and y'. It is easy to see 
that the core graph generated from T' by identifying x and y is the same as the 
one generated by P' from T. From these considerations we obtain that 

P is a partition of V (T x (#)) \ (ty 
such that Trf (r x (H)/ P ) = J J ' 

For example, if A is an immediate quotient of T then px (H, J) — p (T, A) = 
1. For H = {x\X2Xi l x^ 1 ) , Tx (i?) has four quotients at distance 1 and two at 
distance 2 (see Figure [2~H> . 

As aforementioned, by merging a single pair of vertices of Tx (H) (and then 
folding) we obtain the core graph of a subgroup J obtained from H by adding 
some single generator (thought not every element of can be added in this 
manner). Thus, by taking an immediate quotient, the rank of the associated 
subgroup increases at most by 1 (in fact, it may also stay unchanged or even 
decrease). This implies that whenever H <^> J, one has 

rk (J) - rk (H) < p x (H, J) (2.6) 

In |Pudll) (Lemma 11), the distance is bounded from above as well: 

Claim 2.7. Let H,J< fg F k such that H <^ J. Then 

rk (J) - rk (H) < p x {H, J) < rk (J) 

Theorem 11.81 (the proof of which can be found in |Pudll| ) then asserts that 
the lower bound is attained if and only if H is a free factor of J: 

px (H, J) = rk (J) -rk(H) H <J 

In fact, one of the implications of Theorem [L8] is trivial. As mentioned above, 
merging two vertices in Tx (H) translates to adding some generator to H. If it 
is possible to obtain Tx (J) from Tx (H) by rk (J) — rk (H) merging steps, this 
means we can obtain J from H by adding rk (J) — rk (H) extra generators to 

H, hence clearly H < jF^I 

11 This relies on the well known fact that a set of size k which generates is a basis. 



px (H, J) = min 
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The other implication is not trivial and constitutes the essence of the proof 

of Theorem 1 in [Pudllj . The difficulty is that when H <-. J, it is not a priori 
obvious why it is possible to find rk (J) — rk (H) complementing generators of 
J from H, so that each of them can be realized by merging a pair of vertices 
in Tx (H). As mentioned, this theorem is heavily relied upon in the proof of 
Theorem 11.51 

We finish this section with some classical facts about free factors that will 
be useful in the next section. We also give an easy graph-theoretical proof to 
exemplify the strength of core graphs. 

Claim 2.8. Let H, J and K be subgroups of F&. 

(1) If H < J and K < J, then H n K < K . 

(2) IfH,K<J then Hf)K <J. 

(3) If H < J then H is a free factor of any intermediate group H < M < J. 

Proof. Let Y be a basis of J extending a basis Yo of H . Then Ty (J) and Ty (H) 
are bouquets of \Y\, \ Yq\ loops, respectively. 

(1) It is easy to check that Ty (H n K) is obtained from Ty (K) as follows: 
first, delete the edges labeled by Y \ Yq\ then, keep only the connected 
component of the basepoint; finally, trim all "hanging trees" (see the proof 
of Claim Consequently, Ty (H n K) is embedded in Ty (K) and by 

Claim [231 (4), HnK < K. 

(2) This follows from (1) by the transitivity of free factorness. 

(3) Let H < M < J. Since Ty (H) is a bouquet, the morphism t]Jj^ m must 
be injective, and we conclude again by Claim [2T2T 4'). 

□ 

In particular, the last claim shows that if H < Fk then tt (H) — oo (see 
Definition 1 1 .4[) . On the other hand, if H is not a free factor of Ffc, then obviously 
tt (H) < rk (F fe ) = k. Thus 7r (H) e {0, 1, 2, . . . , k} U {oo}. 

3 Algebraic Extensions and Critical Subgroups 

We now return to the sparest partial order we consider in this paper, that of 
algebraic extensions. All claims in this section appear in [KM02, M VW07) . 
unless specifically stated otherwise. We shall occasionally sketch some proofs in 
order to allow the reader to obtain better intuition and in order to exemplify 
the strength of core graphs. 

Recall (Definition 1 1.7p that J is an algebraic extension of H, denoted H < a ig 
J, if H < J and H is not contained in any proper free factor of J. For example, 
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consider H — (x\Xvx± x% / < F2. A proper free factor of F2 has rank at 
most 1, and H is not contained in any subgroup of rank 1 other than itself (as 
x\X2X^ 1 x^ 1 is not a proper power). Finally, H itself is not a free factor of F2 
(as can be inferred from Theorem 11.81 and Figure [33}. Thus, H < a ig F2. In 
fact, we shall see that in this case [H, oo) a[g — {H, F2}. 
We first show that "< a ig" is a partial order: 

Claim 3.1. The relation "< a i g " is transitive. 

Proof. Assume that H < aig M < aig J. Let H < L < J. By Claim HHEO) , 

LnM < M. But H < LDM and H < alg M, so LDM = M, and thus M < L. 

* 

So now M < L < J, and from M < a ig J we obtain that L = J. □ 

Next, we show that "< a ig" is dominated by for every basis X of Ffc. 

Namely, if H < a ig J then H <^> J. This shows, in particular, that the poset of 
finitely generated subgroups of F/. w.r.t. "< a ig" is locally-finite. 

Claim 3.2. If H < a ig J then H <-. J for every basis X o/Ffe. 

Proof. By Claim 12.41 there is an intermediate subgroup M such that H <— 

M < J, and from H < a ig J it follows that M = J. □ 

Remark 3.3. It is natural to conjecture that the converse also holds, namely 
that if H <- . J for ev ery basis X of F^ then H < a ig J- (In fact, this conjecture 
appears in [MVW07], Section 3.) This is, however, false. It turns out that for 
H = (xiX$) and J = (cc-j 2 a?| , X1X2), H < ? J for every basis A of F 2 , but 
J is not an algebraic extension of H |PP12f . However, there are bases of F3 
with respect to which H does not cover J. Hence, it is still plausible that some 
weaker version of the conjecture holds, e.g. that H < a ig J if and only if for 
every embedding of J in a free group F, and for every basis X of F, H <-» J. 
It is also plausible that the original conjecture from |MVW07] holds for Ffc with 
fc>3. 

In a similar fashion, one can ask whether H < J if and only if for some basis X 
of F k , H J. 

Claim 13.21 establishes completely the relations, mentioned in Section [IJ be- 
tween the different partial orders we consider in this paper: inclusion, the big 
family based on <-», and algebraic extensions. Recall that iJ-critical subgroups 
are a special kind of algebraic extensions. Thus: 

Crit(ff) C [H,oo) alg C [ff,oo) 5 C [H,oo)<. 

Theorem 11.81 and Claim [3~2l give the following criterion for algebraic extensions: 

Lemma 3.4. Let H <f g Fk- The algebraic extensions of H are the elements 
of [H,oo)-~ which are not immediate quotients of any subgroup in [H,oo)^ of 
smaller rank. 
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Proof. Let J S [77, oo)^>. If J is an immediate X-quotient of L G [i?, oo)-> 

with rk(L) < rk(J), then by Theorem 11.81 H < L ^ J, hence J is not an 
algebraic extension of H . On the other hand, assume there exists some L such 

that H < L < J. By Claim H2Q there exists M such that H <^ M < L $ J. 

* 

By Claim |2~4T 3) , M J. From Theorem 11.81 it follows that there is a chain 
of immediate quotients M — Mq < Mi < ... < M r — J inside [H, oo)-^ with 
rk (Mi + i) = rk (Mj) + 1, and M r _i is the group we have looked for. □ 

Since the subgraph of T>k induced by the vertices corresponding to [H, oo)^>, 
namely Tx (H) and its descendants, is finite and can be effectively computed, 
Lemma 13.41 yields a straight-forward algorithm to find all algebraic extensions 
of a given H <f g F& (this algorithm was first introduced in [Pudllj V This, 
in particular, allows one to find all ii-critical subgroups, and thus to compute 
the primitivity rank n (H) : the subgroups constituting Crit (H ) are those in 
(-ff, oo) alg of minimal rank, which is n (H). For instance, Figure [2T4l shows that 

for H = {x^x^x^ 1 ) we have H = {H,F 2 }. Thus, Crit (H) = {F 2 } and 
n(H) = 2 (so tt(H) = 1). 

We conclude this section with yet another elegant result from (KM02 , M VW07| 
that will be used in the proof of Theorem 11.51 In the spirit of field extensions, 
it says that every extension of subgroups of Fj, has a unique factorization to 
an algebraic extension followed by a free extension (compare this with Claim 
E3fl,2)): 

Claim 3.5. Let H < J be free groups. Then there is a unique subgroup L of 

J such that H < a ig L < J . Moreover, L is the intersection of all intermediate 
free factors of J and the union of all intermediate algebraic extensions of H : 

L= f) M = (J M (3.1) 

M:H<M<J M: H< alg M<J 

In particular, the intersection of all free factors is a free factor, and the union 
of all algebraic extensions is an algebraic extension. Claim [3~5l is true in general, 
but we describe the proof only of the slightly simpler case of finitely generated 
subgroups. We need only this case in this paper. 

Proof. By Claim 12.81 and rank considerations, the intersection in the middle of 

(|3.ip is by itself a free factor of J. Denote it by L, so we have H < L < J. 
Clearly, L is an algebraic extension of H (otherwise it would contain a proper 
free factor). But we claim that L contains every other intermediate algebraic 

extension of H. Indeed, let H < aig M < J. By ClaimHUnD, H < M n L < M, 
so M n L = M, that is M < L. □ 
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4 Mobius Inversions 



Let (P, <) be a locally-finite poset and let A be a commutative ring with 
unity. Then there exists an incidence algebr<B of all functions from pairs 
{(x,y) E P x P | x < y} to A. In addition to point-wise addition and scalar 
multiplication, it has an associative multiplication defined by convolution: 

(f*g)(x,y)= f( x , z )9(z,y) 

zE[x,y] 

(where x < y and [x,y] — {z\x < z < y}). The unit element is the diagonal 

S(x,y) 



1 x = y 
x < y 



Functions with invertible diagonal entries (i.e. / (x, x) £ A* for all x G P) are 
invertible w.r.t. this multiplication. Most famously, the constant £ function, 
which is defined by £ (x, y) = 1 for all x < y, is invertible, and its inverse, /i, is 
called the Mobius function of P. This means that £*/i = /i * £ = <5, i.e., for 
every pair x < y 

z£[x,y] ze[x,y] 

Let / be some function in the incidence algebra. The function / * C> which 
satisfy (/ * Q (y) = J2 z e[x y] f ( 2 ) ^ s analogous to the right-accumulating func- 
tion in calculus (for g : K — > E this is the function G (y) — f z( -^ xy ^g(z)dz). 
Thus, multiplying a function on the right by \x can be thought of as "right 
derivation". Similarly, one thinks of multiplying from the left by £ and ii as left 
integration and left derivation, respectively. 

Recall the function $ , defined for every pair of free subgroups H, J <f g 
Ffc such that H < J: &h,j (n) is the expected number of common fixed points 
of 

a J.n {H), where aj, n G Horn (J, S n ) is a random homomorphism chosen with 
uniform distribution. We think of $ as a function from the set of such pairs 
(H, J) into the ring of functions N — > Q. 

Let X be a basis of F^. We write $ x for the restriction of $ to pairs (H, J) 
such that H <^ J. As defines a locally finite partial ordering of the set 

of finitely generated subgroups of Fj, , there exists a matching Mobius function, 
fj, x = (C x ) 1 (where £# j = 1 for all H <_ J). Our proof of Theorem [L5l 



2 The theory of incidence algebra of posets can be found in ISta97l . 
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consists of a detailed analysis of the left, right, and two-sided derivations of $ 



x . 



x 



C x =' fi x * $ A " * ,i A " 
By definition, we have for every f.g. H < 7 J: 



Me[i?,J] 



E °at.\ 

M,N:H<^M<^N<^J 



x 



x 



- E 

Are[H,J]- 



"■H,N 



(4.1) 



Note that (|4.1[) can serve as definitions for the three functions L x , C A , R x : for 
instance, L x = [i x * <fr x is equivalent to C, x * L x = $ A , which is the leftmost 
equality above. 

We begin the analysis of these functions by the following striking observation 
regarding R x . Recall ( Claim |3?2|) that if H < a ig J then H <-» J for every basis 
X . It turns out that the function R x is supported on algebraic extensions alone, 
and moreover, is independent of the basis X. 

Proposition 4.1. Let H, J </ s Ffe. 

(1) If H J &it£ J is not cm algebraic extension of H , then R^ / = 0. 

(^) i?^ j = i?]^ j for every basis Y o/F^, whenever both are defined. 

Remark 4.2. The only property of $ we use is that $u,l = &h,j whenever 

H < L < J, which is easy to see from the definition of $. Therefore, the 
proposition holds for the right derivation of every function with this property. 
In particular, the proposition holds for every "statistical" function, in which the 
value of (H, J) depends solely on the image of H via a uniformly distributed 
random homomorphism from J to some group G. 



[H,J]-, 



the size of the 



Proof. We show both claims at once by induction on 
closed interval between H and J. The induction basis is H — J. That H < a ig H 
is immediate. By (14. ip . R^h = ^h.h and so R^ h 1S indeed independent of 
the basis X. 



Assume now that 



[H,J]-. 



— r and that both claims are proven for every 

pair bounding an interval of size < r. By (|4.ip and the first claim of the 
induction hypothesis, 



R H,J - ®H,J - 



E 



E 



R 



x 

H,N 



(4.2) 



N:H< alg N<^,J 
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By Claim [HP, |jV H < alg N <^ A = {N \ H < alg N < J}, and the lat- 
ter is independent of the basis X. Furthermore, by the induction hypothesis 
regarding the second claim, so are the terms N in this summation. This 
settles the second point. 

Finally, if J is not an algebraic extension of H then let L be some inter- 

* 

mediate free factor of J, H < L ^ J. As mentioned above, this yields that 
<&h,j = &h,l- Therefore, 



R H,J - ®H,J - ^ R H,N - ^H,L - R H,N ~ R 

N£[H,J)^ N£[H,L]^ Ne[H,J)^\[H,L 



X 

H.N 



by definition 

By Claim 13.51 all algebraic extensions of H inside the interval [H, J] -» are con- 
tained in L. Hence, every subgroup N G [H,J)~ \ [H,L}^> is not an algebraic 
extension of H, and by the induction hypothesis R^ N vanishes. The desired 
result follows. □ 

In view of Proposition ^. II we can omit the superscript and write from now on 
Rh.j instead of R H j. Moreover, we can write the following "basis independent" 
equation for every pair of f.g. subgroups H < J: 

N:H< alg N<J 

When H <^> J this follows from the proof above. For general H < J, there 

is some subgroup L such that H <^> L < J and every intermediate algebraic 
extension H < a ig N < J is contained in L (see Claims |2~H and |3~5"T) . Therefore, 

&h,j — &h,l — ^ Rh,n = ^ Rh.n- 

N: H< alg N<L N: H< a i g N<J 

It turns out that unlike the function R, the other two derivations of <I>, 
namely L x and C x , do depend on the basis X. However, the latter two func- 
tions have combinatorial interpretations. In the next section we show that <&h,J 
and j can be described in terms of random coverings of the core graph 
Fx (J), and that explicit rational expressions in n can be computed to express 
these two functions for given H, J (Lemmas 15.21 and 15.31 below). This, in turn, 
allows us to analyze the combinatorial meaning and order of magnitude of C x t N 
(Proposition 16. ip . 

Finally, using the fact that R is the "left integral" of C x , that is R = ( x *C X , 
we finish the circle around the diagram of $'s derivations, and use this analysis 
of L x and C x to prove that for every pair H < a ig J, Rh,j does not vanish 
and is, in fact, positive for large enough n. This alone gives Theorem II .21 The 
more informative 11.51 follows from an analysis of the order of magnitude of Rh. j 
in this case (Proposition! 
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5 Random Coverings of Core Graphs 



This section studies the graphs which cover a given core-graph in the topological 

^ v 

sense, i.e. r -» T with p locally bijective. We call these graphs (together with 
their projection maps) coverings of T. The reader should not confuse this with 
our notion "covers" from Definition 12.31 

We focus on directed and edge-labeled coverings. This means we only con- 

sider T -» T such that T is directed and edge-labeled, and the projection p 
preserves orientations and labels. When T is a core-graph we do not assume 
that r is a core-graph as well. It may be disconnected, and it need not be 
pointed. Nevertheless, it is not hard to see that when T and T are finite, for 
every vertex v in p~ x ((g)), the fiber over T's basepoint, we do have a valid core- 
graph, which we denote by r„: this is the connected component of v in T, with 
v serving as basepoint. Moreover, the restriction of the projection map p to T v 
is a core-graph morphism. 

The theory of core-graph coverings shares many similarities with the theory 
of topological covering spaces. The following claim lists some standard proper- 
ties of covering spaces, formulated for core-graphs. 

^ p 

Claim 5.1. Let T be a core-graph, Y -» T a covering and v a vertex in the fiber 
p- 1 (®). 

(1) The group tt^ (T) acts on the fiber p^ 1 (®), and these actions give a cor- 
respondence between coverings ofT and 7iy (T)-sets. 

(2) In this correspondence, coverings ofT with fiber {1, . . . , n} correspond to 
actions of tv* (T) on {1, . . . , n}, i.e., to group homomorphisms (T) — > 
S n • 

(3) The group (j^v^j is the stabilizer ofv in the action of (T) onp^ 1 (®) 
(note that tt^ i^v^j an d n i (T) are both subgroups o/F (X)). 

(4) A core-graph morphism A — > V can be lifted to a core- graph morphism 
A — » r„ (i.e., the diagram 




can be completed) if and only ifn* (A) C \J"vj ■ By the previous point, 
this is equivalent to saying that all elements of ' tt± (A) fix v. 

We now turn our attention to random coverings. The vertex set of an n- 
sheeted covering of a graph T = (V, E) can be assumed to be V x {1, . . . , n}, so 
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that the fiber above v £ V is {v} x {1, . . . , n}. For every edge e = (u, v) £ E, 
the fiber over e then constitutes a perfect matching between {v} x {l,...,n} 
and {u} x {1, ...,n}. This suggests a natural model for random n-coverings 
of the graph T. Namely, for every e £ E choose uniformly a random perfect 
matching (which is just a permutation in S n ). This model was introduced in 
|AL02| . and is a generalization of a well-known model for random regular graphs 
(e.g. |BS87) )F^1 Note that the model works equally well for graphs with loops 
and with multiple edges. 

In fact, there is some redundancy in this model, if we are interested only 
in isomorphism classes of coverings (two coverings are isomorphic if there is 
an isomorphism between them that commutes with the projection maps). It is 
possible to obtain the same distribution on (isomorphism classes of) n-coverings 
of r with fewer random permutations: one may choose some spanning tree T 
of r, associate the identity permutation with every edge in T, and pick random 
permutations only for edges outside T. 

We now fix some J <f g Fk, and consider random coverings of its core-graph, 
Tx (J)- We denote by Tx (J) a random n-covering of Tx (J), according to one 
of the models described above. If p : Fx (J) — > Fx (J) is the covering map, 
then Tx (J) inherits the edge orientation and labeling from Tx (J) via p . 
For every i (1 < i < n), we write Tx (J)^ for the core-graph Tx (J)/® ;) (the 

component of (®, i) in Tx (J) with basepoint (®, i)). 

By Claim [S~T1 (2]). each random n-covering of Tx (J) encodes a homomor- 
phism ctj <n £ Horn (J, S n ), via the action of J = ir* (Tx (J)) on the basepoint 
fiber. Explicitly, an element w £ J is mapped to a permutation aj. n (w) £ S n 
as follows: w corresponds to a closed path p w around the basepoint of Tx (J)- 
For every 1 < i < n, the lift of p w that starts at (<8>,j) ends at (<8>,j) for some 
j, and a J>n (w) (i) = j. 

By the correspondence of actions of J on and n-coverings of 

Tx (J), otj,n is a uniform random homomorphism in Horn ( J, S n ). This can 
also be verified using the "economical" model, as follows: choose some basis 
Y = {yi, . . . , y r k(j)} f° r J via a choice of a spanning tree T of Tx (J) and 
of orientation of the remaining edges, and choose uniformly at random some 
o r £ S n for every basis element y r . Clearly, otj. n 

We can now use the coverings of Tx (J) to obtain a geometric interpretation 
of &h,j, as follows: let H < J < Fk and 1 < i < n. Bv l5.1l( 4l). the morphism 
Vh^j '■ (H) — > T x (J) lifts to a core-graph morphism T x (H) — > T x {J) l iff 
H = it* (Tx (H)) fixes ((g), i) via the action of J on the fiber ® x {1, . . . , n}. 
Since this action is given by aj jn , this means that n^^j lifts to Tx (J\ ex- 
actly when aj, n (H) fixes i. Recalling that &h,.j (n) is the expected number of 
elements in {1, . . . , n} fixed by aj, n (H), we obtain an alternative definition for 
it: 



13 Occasionally these random coverings are referred to as random lifts of graphs. We shall 
reserve this term for its usual meaning. 
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Lemma 5.2. Let Tx {J) be a random n-covering space ofTx (J) in the afore- 
mentioned model from [AL021. Then, 

<&h,j ( n ) = The expected number of lifts of Vfe^j to Tx {J)- 

?x (J) 



T X (H)^^Tx (J) 

Note that this characterization of $>h,j involves the basis X, although the 
original definition (| 1 . 1 j) does not. One of the corollaries of this lemma is therefore 
that the average number of lifts does not depend on the basis X. 

Recall (Section 2]) the definition of the function L x , that satisfies $>h,j = 
J2 M e[H j] j for every H <^> J. It turns out that this derivation of $ also 

X 

has a geometrical interpretation. Assume that n^^j does lift to % : Tx (H) — > 
Fx (J)i- By Claim 12^41 % decomposes as a quotient onto Tx (M), where M — 
w x (imrji), followed by an embedding. Moreover, M lies in [H, J]^>. On the 
other hand, if there is some M £ [H, J] such that Tx (M) is embedded in 

(J) i then such M is unique and % lifts to the composition of ij^_^ M with 
this embedding. We obtain 

<&h,j (n) — Expected number of lifts of rjjj^j to Tx (J) 

= Expected number of injective lifts of rjfj^ j to Tx (J)- 

Me[H,J]^ 

x 

Taking the left derivations, we obtain: 

Lemma 5.3. Let M <-» J , and let Tx (J) be a random n-covering space of 
Tx (J) in the aforementioned model from JXL02]. Then, 

L^j j (n) — The expected number of injective lifts of n^^j to Tx (J)- 

fx (J) 

Tx (M) Tx (J) 

Unlike the number of lifts in general, the number of injective lifts does depend 
on the basis X. For instance, consider M — (x\X2) and J = (x\,X2) = F2. 
With the basis X — {xi,X2}, the probability that rj^^j lifts injectively to 
Tx (J)i equals (Lemma 15.41 shows how to compute this). However, with 
the basis Y = {x\X2 : X2}, the corresponding probability is — . We also remark 
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that Lemma 15.31 allows a natural extension of L x to pairs M, J such that M 
does not X-cover J. 

Lemma I531 allows us to generalize the method used in [Nic94 ( LPlOl lPudll| 
to compute the expected number of fixed points in a n (w) (see the notations 
before Theorem (|1.2I )). We claim that for n large enough, L* { j (n) is a simple 
rational expression in n. 

Lemma 5.4. Let M,J <f g such that M <-» J, and let r\ = rf^^j be the 
core-graph morphism. For large enough n, 



L M ,j («) = f? r~, (5.1) 

eeE(T x (J)) 

Here, "larqe enouqh n" is n > max t?" 1 (el (so that the denominator does 

~ e&E(r x {J)) 1 W| 

not vanish), and (n) r is the falling factorial n (n — 1) . . . (n — r + 1). 

Proof. Let v be a vertex in Fx (J) and consider the fiber rp 1 (u) in Tx (M). For 
every injective lift rj : Tx {M) <-}■ Tx (J), the fiber i]~ 1 (u) is mapped injectively 
into the fiber p" 1 (v). The number of such injections is 

(n)\ v -Hv)\ = n(n- 1) . . . (n- |J7 _1 («)I + 1), 



n w 1,-1(1, 



v(r x (M)) 



into F (P x (J)) is the 



and therefore the number of injective lifts of r\ 
numerator of (|5.1j) . 

We claim that any such injective lift has a positive probability of extending 
to a full lift of r\: all one needs is that the fiber above every edge of Tx (J) 
satisfy some constraints. To get the exact probability, we return to the more 
"wasteful" version of the model for a random n-covering of Tx (J), the model in 
which we choose a random permutation for every edge of the base graph. Let 
rj : V {Tx (M)) ^ V (t x (J)) be an injective lift of the vertices of T x (M) 
as above, and let e be some edge of Tx (J)- If rj is to be extended to r/ _1 (e), 
the fiber above e in Tx (J) must contain, for every (u,v) € vT 1 (e), the edge 
(rj(u) ,fj(v)) . 

Thus, the random permutation a £ S n which determines the perfect match- 
ing above e in Tx (J), must satisfy 1 77 1 (e)| non-colliding constraints of the 
form a (i) — j. Whenever n > 1 77 1 (e)| (which we assume), a uniformly ran- 
dom permutation in S n satisfies such constraints with probability 

1 



(™)|„-i(e)| 

This shows the validity of (|5.1j) . □ 
This immediately gives a formula for <&h,j as a rational function: 
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Corollary 5.5. Let H, J <f g F& such that H <-» J. Then, for large enough n, 

* BtJ (n)= E W0 = E (n) — • 

Since H X-covers every intermediate M £ [if, J] -» , the largest fiber above 
every edge of Tx (J) is obtained in Tx (H) itself. Thus, "large enough n" in the 
last corollary can be replaced by n > max (Vh^j) * ( e ) 

In fact. Corollary 15.51 applies, with slight modifications, to every pair of f.g. 
subgroups H < J: Lemma 15.21 holds in this more general case, that is &h,j 
is equal to the expected number of lifts of Tx (H) to the random n-covering 
Tx (</)• The image of each lift (with the image of (g> as basepoint) is a core 
graph which is a quotient of Tx (H), and so corresponds to a subgroup M such 
that H <-» M < J . In explaining the rational expression in Lemma 15.41 we did 
not need M to cover J. Thus, for every H < J, both finitely generated, 



n (n) 

, , y&v(r x (j)) 
®h,j (n) = 2^ 



Corollary 15.51 yields in particular a straight-forward algorithm to obtain a 
rational expression in n for &h.j (n) (valid for large enough n). For example, 
consider H = (xiX%Xi X2 / and F 2 = (xi,x 2 ). The interval [H : F 2 ]^ consists 
of seven subgroups, as depicted in Figure 12.41 Following the computation in 
Corollary 15.51 we get that for n > 2 (we scan the quotients in Figure [3^] top- 
to-bottom and in each row left-to- right): 

(n) 4 (n), (n) 2 



$ff,F 2 (n) 



(n) 2 (n) 2 (n) 2 (n) x (n) x (n) 2 

H 3 W 3 (^) 2 



+ 



(n) 2 (n) 2 W 2 (n) 2 (n) 2 (n) 2 (n) 1 



n- 1 



This exemplifies Theorem 11.51 and Table [T] for = (a;ix 2 x 1 1 x 2 1 ) (recall the 
discussion following Lemma 13.41 where it is shown that 7r (if) — 2 and that 
Crit(iJ) = {F 2 }). 

Remark 5.6. The discussion in this section suggests a generalization of our 
analysis to finite groups G other than S n . For any (finite) faithful G-set S, one 
can consider a random ^-covering of Tx (J)- The fiber above every edge is 
chosen according to the action on S of a (uniformly distributed) random element 
of G. In this more general setting we also get a one-to-one correspondence 
between Hom(Ffc,G) and 15*1 -coverings. Although the computation of L x and 
of $ might be more involved, this suggests a way of analyzing words which are 
measure preserving w.r.t. G. 
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6 The proof of Theorem 11.51 

The last major ingredient of the proof of our main result, Theorem 11.51 is an 
analysis of C x , the double-sided derivation of <F Recall Definition 12.61 where 
the X-distance px (H, J) was defined for every H, J <f g F k with H J. 

Proposition 6.1. Let M,N < fg F k satisfy M <^ N. Then 

Cm N (n) = 0[ - 

Section EH] is dedicated to the proof of this proposition. But before getting 
there, we want to show how it practically finishes the proof of our main result. 
We do this with the following final step: 

Proposition 6.2. Let H,N <f g F k satisfy H < atg N. Then 

Rh n (n) = -J — +0[ 

Proof. Let X be some basis of Ffe. Recall that R = C, x * C x , i.e. 

Rh,n (n) = C m,n ( n ) • 

Me[H,N]-, 

X 

For M — N we have C§ N (n) = R N ,N (") = $N,N (n) = n'^^ (the last 
equality follows from the fact that m independent uniform permutations fix a 
point with probability n~ m ). For any other M, i.e. M € [H,N)-», the fact 
that iV is an algebraic extension of H means that M is not a free factor of N 
and therefore, by Theorem (and (pjj)). p x (M, N) > rk(iV) - rk(M) + 1. 
Proposition 16. II then shows that 

Cm.^v (n) e O f - — ^ — — ) CO / ' 



n rk(M)+p x (M,Af) y - \ vn rk(JV) + l 



Hence, 

W = C* N (n) + ^ C X UN (n) = ^+0 (-J^) . 

X 

□ 

The proof of Theorem 11.51 is now at hand. For every H,J <f g F k with 
H < J, by and Proposition O 

= 51 Rh,n i n ) 

N:H< alg N<J 

= (n) + ^ iijj,jv (n) 

N:H-$ alg N<J 

V -JLW 1 

^ „rk(i\M 



r k(H) ^ r,rk(iV) V „rk(iV) + l 

N : H< alg N<J x 
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For J = Ffc we can be more concrete. Recall that the ^/-critical groups, Crit (H), 
are its algebraic extensions of minimal rank (other than H itself), and this 
minimal rank is ir(H). Therefore, 



$HF k M = -J—+ V -J— + o(~ 

N£(H,oo) alg 

1 , |Crit(*OI kQ ( 1 



rk(ff) n 7r( - H '> \ n"'( ff )+ 1 



This establishes our main results: Theorem ll.51 Theorem 11.21 and all their corol- 
laries. 



6.1 The Analysis of C 



x 

M,N 



In this subsection we look into C x , the double-sided derivation of $, and estab- 
lish Proposition 16. U which deals with the order of magnitude of C M N . Recall 
that by definition C x 



T x 



L x * fi x , which is equivalent to 
(VM < ? J 



JVe[M,j]_» 



x 

M.N 



(6.1) 



We derive a combinatorial meaning of C x { N from this relation. To obtain this, 
we further analyze the rational expression (|5.1[) for L x { 7 and write it as a 
formal power series. Then, using a combinatorial interpretation of the terms 
in this series, we attribute each term to some N £ [M, J]-., and show that for 
every N € [M, J]-», the sum of terms attributed to N is nothing but N . 
Finally, we use this combinatorial interpretation of C X j N to estimate its order 
of magnitude. 

Rewriting L x { 7 as a Power Series in n 1 

Consider the numerator and denominator of (I5.1[) : these are products of expres- 
sions of the type (n) . It is a classical fact that 



(n) r 



(-1)- 



where r . is the unsigned Stirling number of the first kind. That is, 

number of permutations in S r with exactly j cycles (see, for instance, [v 
Chapter 13). 



is the 
AYOT] . 



We introduce the notation 



r—3 



which is better suited for our 



purposes. The cycles of a permutation a £ S r constitute a partition P a of 
{l,...,r}. We define ||cr|| = \\P a \\ (recall (|2.4p ). and it is immediate that [r] ■ 
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is the number of permutations a £ S r with 1 1 a || = j. It is also easy to see that 
\\<j\\ is the minimal number of transpositions needed to be multiplied in order 
to obtain a. Therefore, [r] ■ is the number of permutations in S r which can be 
expressed as a product of j transpositions, but no less. In terms of this notation, 
we obtain 



The product of several expressions of this form, namely (n) ri {ri) r2 . . . (n) r , can 
be written as a polynomial in n whose coefficients have a similar combinatorial 
meaning, as follows. Let X be a set, and 99 : X — »• {1, . . . , £} some function with 
fibers of sizes \if~ 1 (i)\=r i (l<i<£). We denote by 

Sym v (X) = {a £ Sym (X) p o a = ip} 

the set of permutations a £ Sym (X) subordinate to the partition of X induced 
by the fibers of ip, i.e., such that tp (a (x)) = (p (x) for all x £ X. We define 

[X]* = \{aeS Y m v (X) : = j}\ , 

the number of (^-subordinate permutations with \\a\\ = j. Put differently, [X]J 
counts the permutations counted in [ \X \ ] which satisfy, in addition, that every 
cycle consists of a subset of some fiber of ip. With this new notation, one can 
write: 

I / n-i \ |X| 

(»)rx (")r 2 • ■ ■ « = II K* E Wm ) = XI ^ ^ 

j=l \ m =0 / j=0 

Turning back to (|5.1j) . we let Vm and 1?m denote the sets of vertices and 
edges, respectively, of Tx (M). We denote by 77 the morphism r) M ^.j, and use 
it implicitly also for its restrictions to Vm and Em, which should cause no 
confusion. We obtain 

\Vm\ 

n \v M \ £ (-iy[V M ]]n-i 
L M ,j(n) = - J= ° 



which by Claim I2.1t |2")) equals 



\E M \ 
J=0 



|Vm| 

E (-l^Mn--* 



(n) = n-^Jf. . (6.2) 

E (-i) J [s M ]>-^ 

i=o 
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Consider the denominator of (|6.2p as a power series Q (n^ 1 ). Its free coefficient 
is [Em}q = 1- This makes it relatively easy to get a formula for its inverse 
1/Q as a power series. In general, if Q (x) = 1 + X^tli a % x% ■> then 



oo / oo 



= E E (- 1 )* * • • • • • °j* 



'=0 ji ,j 2 ,~- j't>l 



In the denominator of (|6.2[) we have a% — ( — 1)' [£m]?, and the resulting ex- 
pression needs to be multiplied with the numerator ^}J=q (~^Y Wm]] n~* . In 
total, we obtain 



Lm.j (") = 

OO 

E E (-lr^*^-^-...-^]^-^- 2 ^* (6.3) 

t=0 io>o 



The Combinatorial Meaning and Order of Magnitude of Cj^ w 

The expression (j6.3[) is a bit complicated, but it presents L M j (n) as a sum 
(with coefficients ±n~ s ) of terms with a combinatorial interpretation: the term 
[Vm]J • [Em]] 1 ■ ■ ■ ■• [Em]] counts (t + l)-tuples of 77-subordinate permutations. 
The crux of the matter is that this interpretation allows us to attribute each 
tuple to a specific subgroup N £ [M, J]-». This is done as follows. 

Let (cto, <7i, . . . , a t ) be a (t + l)-tuple of permutations such that er G Sym^ (Vw) 
and CTi, . . . , cr t € Sym^ (.Em) \ {id} (we exclude id € Sym (Em), which is the 
only permutation counted in [Em]q)- Consider the graph T — T x( M )/ (a a ,... ,o- t ), 
which is the quotient of Tx (M) by all identifications of p airs of the form 
v,a (v) (v G Vm) and e, a\ (e) (e € £mj 1 < i < £]3 Since T is ob- 
tained from Fx (M) by identification of elements with the same 77-image, 77 
induces a well defined morphism T — > Tx (</)• Thus, every closed path in F 
projects to a path in Fx (J), giving (T) < (Tx (J)) = J- We denote 
TV = N a0!< r lt ... j( r t — n± (T). As usual (see Figures [2.2[ 12. 3p . we can perform 
Stallings foldings on T until we obtain the core graph corresponding to AT, 
T x (N). Obviously we have M <^ TV, and by Claim also N <^ J. 

14 For the definition of the quotient of a graph by identifications of vertices see the discussion 
preceding Figure 12.31 Although we did not deal with merging of edges before, this is very 
similar to merging vertices. Identifying a pair of edges means identifying the pair of origins, 
the pair of termini and the pair of edges. In terms of the generated core graph (see Section 
[2t . identifying a pair of edges is equivalent to identifying the pair of origins and\or the pair 
of termini. 
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Thus, we always have N — N aQtCri „_ tCrt G [M, J]-». To summarize the situation: 
T x (M) ^ r = ri(«)/^„,^ gX r x (JV) r x (J) (6.4) 

'IM-i-N 

Our next move is to rearrange (|6.3j) according to the intermediate subgroups 
iV € [M, J]-, which correspond to the tuples counted in it. For any N G [M, J]-* 
we denote by TJjJ^ j the set of tuples (<Jo, cti, . . . , <r t ) such that N CTOt(Jlt ___ m = N, 
i.e. 



Tju.jv,,/ = { ((T ,(Ti,...,at) 



t G N, Co G Sym,(V^) 
cti, . . . , er t G Sym^ (E M ) \ {id} 

The terms in (|6.3[) which correspond to a fixed iV G [M, J] thus sum to 



C^,j(N)= ( 1} T . (6-5) 

rk(M)+Elkdl 



i=0 

:T* .. . 

n 



(o-o,<xi,...,<xt)eT^ JVJ 



and ()6.3[) becomes 

l m,j= E ( 6 - 6 ) 

JVe[M,J]-. 



The equation (|6.6[) looks much like (|6.ip . with Cmj(N) playing the role of 
C X j N . In order to establish equality between the latter two, we must show that 
j (TV) does not depend on J. Fortunately, this is not hard: it turns out that 

&m,j (N) = C* tti (N) (VJV G [M, J] ? ) , (6.7) 

and the r.h.s. is, of course, independent of J. This equality follows from 
7~m n j = ^mjvtvj which we now justify. The only appearance J makes in 
the definition of T^j N j is inside 77 = rj^^j, which is to be <7i-invariant (for 
< i < n). i.e., a t must satisfy r) X I ^ J o(j l = Vm^j- If (00, ■ ■■ ,<?t) G T x lNJ then 
r lM^N OCJ i — Vm^n follows from the fact that Tx (N) is a quotient of r x( M )/{o-i). 
On the other hand, if (<Xo, . . . , <Xt) G TJjJ^v n then we have ij^^j^ o cr^ = ry^^^y, 
hence also (see 



Writing Cf / Ar d = :N (N), we have by (OH, (EH), and flfE?} 

C x * ( x = L x = C x * c x 
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which shows that C — C , as desired. 

We approach the endgame. Let (erg, 01, . . . , ai) € TJjf N j = T^j N N , and 
consider the partition P of V (Fx (H)), obtained by identifying v and v' when- 
ever (Tq (v) — v f , or <jj (e) = e' for some 1 < i < t and edges e, e' whose 
origins are u and u', respectively. Since P can clearly be obtained by X)i=o ll" 7 *!! 
identifications, we have ||P|| < X)t=o II ( a strong inequality can take place - 
for example, one can have o\ — 02). Since (ao,ai, . . . ,a t ) 6 N j we have 
Trf (rx(ff)/p) = AT, and thus by (j2~5]) we obtain 

t 

px (f,j)<||p||<£|H|. 

From (JH5J) (recall that C$ 7 (A r ) = C§ N = N ) we now have 

Cm,jv (n) - O ( n ~ k(M)+ 1 px(MAr) ) . 
and Proposition 16. II is proven. 



7 Primitive Words in the Profinite Topology 

Theorem 1 1 . 21 has some interesting implications to the study of profinite groups. 
In fact, some of the original interest in the conjecture that is proven in this 
paper stems from these implications. 

Let Ffc denote the profinite completion of the free group Ffc. A basis of Ffc 
is a set S C Ffc such that every map from 5* to a_j>rofinite group G admits a 
unique extension to a continuous homomorphism Ffc. — > G. It is a standard fact 
that Ffc is embedded in Ffc, and that every basis of Ffc is also a basis of Ffc 
(see for example [Wi l98jV An element of Ffc is called primitive if it belongs to 
a basis of Ffc . 

It is natural to ask whether an element of Ffc, which is primitive in Ffc, 
is already primitive in Ffc. In fact, this was conjectured by Gelander and 
by Lubotzky, independently. Theorem 11.21 yields a positive answer, as fol- 
lows. An element w G Ffc is said to be measure preserving if for any finite 
group G, and a uniformly distributed random (continuous) homomorphism 

aQ G Hom con t ^Ffc,G^, the image &q (w) is uniformly distributed in G. By 
the natural correspondence Hom co „t ^Ffc, G^j = Horn (Ffc, G), an element of Ffc 

is measure joreserving w.r.t. Ffc iff it is so w.r.t. Ffc. As in Ffc, a primitive 
element of Ffc is easily seen to be measure preserving. Theorem 11.21 therefore 
implies that if w £ Ffc is primitive in Ffc, then it is also primitive in Ffc. In 
other words: 

Corollary 7.1. Let P denote the set afprimitive elements o/Ffc, and let P 
denote the set of primitive elements o/Ffc. Then 

P = Pf)F k . 
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As P is a closed set in F&, this immediately implies Corollary 11.31 which 
states that P is closed in the profinite topology. 

This circle of ideas has a natural generalization. Observe the following five 
equivalence relations on the elements of F& : 

A 

• W\ ~ W2 if w\ and u>2 belong to the same Aut F^-orbit. 

• w\ ~ t«2 if ifi and belong to the same Aut Ffc-orbit (where Aut Ffc is 
the closure of Aut Ffc in Aut Ffc). 

c — 

• w\ ~ u>2 if lUi and W2 belong to the same Aut Ffc-orbit. 

G' 

• w\ ~ u>2 if wi and u>2 have the same "statistical" properties, namely if 
they induce the same distribution on any finite, group. 

c" 

• w\ ~ W2 if the evaluation maps ev Wl , ev W2 : Epi (Fk , G) — > G have the 
same images for every finite group G. 

It is not hard to see that (A) => (B) =► (C) =^> (C) ^ (C") (namely, that if 

w\ ~ ui2 then wi ~ W2, and so on). The only nontrivial implication is (C) => 
(C"), which can be shown by induction on the size of G. In an unpublished 
manuscript, C. Meiri gave a one-page proof that (C), (C) and (C") in fact 
coincide (in fact, these three coincide for all elements of Ffc). 

From this perspective, our main result shows that in the case that wi is 
primitive, all five relations coincide, and it is natural to conjecture that they in 
fact coincide for all elements in FiF^I. Showing that (A) <= (B) would imply 
that Aut Ffc-orbits in Ffc are closed in the profinite topology, and the stronger 
statement (A) <= (C) would imply that words which lie in different Aut Ffc- 
orbits can be told apart using statistical methods. 

The analysis which is carried out in this paper does not suffice to answer 
the general case. For example, consider the words w\ = x\X2X\X2 l and W2 = 
X\X2X\ %2 ■ They belong to different Aut F2-orbits, as W2 € F' 2 but w\ 
F 2 , but induce the same distribution on S n for every n: their images under 
a random homomorphism are a product of a random permutation (<r) and a 
random element in its conjugacy class (tctt -1 for u>i, and rer^V -1 for w%). 

These questions also play a role in the theory of decidability in infinite 
groups. A natural extension of the word-problem and the conjugacy-problem, is 
the following automorphism-problem: given a group G generated by S, and two 
words wi,W2 6 F (S), can it be decided whether W\ and W2 belong to the same 
Aut G-orbit in Gl Whitehead's algorithm |Whi36a[ Wh i36b] gives a concrete 
solution when G = Ffc. Showing that (A) ■<= (B) would provide an alternative 
decision procedure for Ffc. 

More generally, and in a similar fashion to the conjugacy problem, it can be 
shown that if 

15 In [AVll , for example, the authors indeed ask whether (C) (A). 
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(1) G is finitely presented 

(2) Aut G is finitely generated 

(3) Aut G-orbits are closed in the profmite topology 

then the automorphism-problem in G is decidable. For the free group (1) and 
(2) are known, and (3) is exactly the conjectured coincidence (A) (B). 

8 Open Problems 

We mention some open problems that naturally arise from the discussion in this 
paper. 

• Section [7] shows how the questions about primitive elements can be ex- 
tended to all Aut Ffc-orbits in Ffe (is it true that (A) O (£?), and even the 
stronger equivalence (A) <£4> (C)?). More generally, can statistical prop- 
erties tell apart two subgroups i?i,i?2 </g Ffe which belong to distinct 
Aut Ffc-orbits? This would be a further generalization of Theorem 11.21 

• It is also interesting to consider words which are measure preserving w.r.t. 
other types of groups. For instance, does Theorem 11.21 still hold if we 
replace "finite groups" by "compact Lie groups"? Within finite groups, we 
showed that measure preservation w.r.t. S n implies primitivity. Is it still 
true if we replace S n by some other infinite family of finite groups (e.g. 
PGL n ( g ))? 

• Is it true that 

[ff,oo)< = |J [if, 00)3. 

X is a 
basis of F ^ 

and under which assumptions does the following hold 

[ ff .°°u= n ^°°h 

X is a 
basis of F fe 

(see Remark |3 . 3[) ? 

• Finally, there are interesting questions regarding the distribution of the 
primitivity rank. It is known that primitive words are rare (in the sense 
that the proportion of primitive words among all words of length t tends 
to as t — > 00, [BMS02J). Is it true, for example, that being of primitivity 
rank k (it (w) = k) is generic, in the sense that the proportion of such 
words tends to 1 as t — > 00? In addition, almost primitive words are 
words w £ F fe such that [(w) , oo) a , g = {(w) , F fe } [FRSS98] , What is the 
proportion of these words inside the class of words with primitivity rank 
fc? 
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Glossary 







Reference 


Remarks 


H <fg F k 


finitely generated 






H < J 


free factor 






H <alg J 


algebraic extension 


Definition 11.71 




H J 

— X 


H X-covers J 


Definition E3l 


H -» J in |Pudll| 




{L | H ^ L ■< J} 




< is either one of 

<,<,<alg Or ^ 

(standing for <-» ) 




{L\H r< J} 




[H,oo)^ 


{L\H ^L} 






the A-fringe of H 




Ox (H) in [MVW07J 




algebraic extensions of H 




AE (H) in |MVW07| 


n(H) 


primitivity rank of H 


Definition 11.41 




Grit (H) 


iJ-critical groups 




r x (H) 


X-labeled core graph of 
H 






Px (H, J) 


A-distance 


Definition El 


H J 

— X 




the morphism 
T x (H) -> T x (J) 


Claim O 


H < J 


CtJ,n 


a uniformly chosen 
random homomorphism 
in Horn (J,S n ) 




J <fg F k 


3>h,j (n) 


the expected number of 
common fixed points of 
aj, n (H) 


cm 


H < J 
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